Search CORE

ART

Structural motifs recurring in different folds recognize the same ligand fragments

Author: A Brakoulias
A Shulman-Peleg
A Via
AE Todd
AG Murzin
AN Lupas
ATR Laurie
BL Roth
BN Chaudhuri
D Devos
DJ Peet
E Michalsky
Elena Gatti
F Diella
F Ferre
G Ausiello
G Ausiello
G Ausiello
G Wolber
Gabriele Ausiello
GO Reznik
HE Xu
I Nobeli
J Ruppert
JD Watson
JD Westbrook
JW Torrance
K Kinoshita
K Shah
KA Denessiouk
L Malinina
M Keil
M Nayal
M Silberstein
Manuela Helmer-Citterich
MJ Coon
N Kobayashi
ND Gold
Ottaviano Incani
PF Gherardini
PF Gherardini
Pier Federico Gherardini
PJ Simpson
RJ Najmanovich
RT Koehler
S Akira
S Schmitt
S Velankar
V Cappello
V Nahoum
Z Weng
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background The structural analysis of protein ligand binding sites can provide information relevant for assigning functions to unknown proteins, to guide the drug discovery process and to infer relations among distant protein folds. Previous approaches to the comparative analysis of binding pockets have usually been focused either on the ligand or the protein component. Even though several useful observations have been made with these approaches they both have limitations. In the former case the analysis is restricted to binding pockets interacting with similar ligands, while in the latter it is difficult to systematically check whether the observed structural similarities have a functional significance. Results Here we propose a novel methodology that takes into account the structure of both the binding pocket and the ligand. We first look for local similarities in a set of binding pockets and then check whether the bound ligands, even if completely different, share a common fragment that can account for the presence of the structural motif. Thanks to this method we can identify structural motifs whose functional significance is explained by the presence of shared features in the interacting ligands. Conclusion The application of this method to a large dataset of binding pockets allows the identification of recurring protein motifs that bind specific ligand fragments, even in the context of molecules with a different overall structure. In addition some of these motifs are present in a high number of evolutionarily unrelated proteins.</p

ART

FunClust: a web server for the identification of structural motifs in a set of non-homologous protein structures

Author: A Henschel
A Stark
A Via
AC Wallace
AD Hill
Allegra Via
Anna Tramontano
CT Porter
F Ferre
G Ausiello
G Ausiello
Gabriele Ausiello
GK Sandve
GR Stockwell
HM Berman
KA Denessiouk
LH Greene
M Novotny
M Shatsky
M Shatsky
Manuela Helmer-Citterich
N Hulo
P Puntervoll
P. Marcatili
Paolo Marcatili
PF Gherardini
Pier Federico Gherardini
S Jones
SL Moodie
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

The occurrence of very similar structural motifs brought about by different parts of non homologous proteins is often indicative of a common function. Indeed, relatively small local structures can mediate binding to a common partner, be it a protein, a nucleic acid, a cofactor or a substrate. While it is relatively easy to identify short amino acid or nucleotide sequence motifs in a given set of proteins or genes, and many methods do exist for this purpose, much more challenging is the identification of common local substructures, especially if they are formed by non consecutive residues in the sequence

ART

Discriminative structural approaches for enzyme active-site prediction

Author: A Stark
A Stark
AC Wallace
B Colson
CS Wright
DG Kendall
EC Webb
GJ Kleywegt
JA Barker
JS Fetrow
JW Torrance
KC Chou
L Holm
M Gribskov
N Nagano
N Nagano
Nozomi Nagano
PF Gherardini
RA Laskowski
T Hastie
T Kato
T Kato
T Kato
Tsuyoshi Kato
VA Ivanisenko
Y Loewenstein
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Predicting enzyme active-sites in proteins is an important issue not only for protein sciences but also for a variety of practical applications such as drug design. Because enzyme reaction mechanisms are based on the local structures of enzyme active-sites, various template-based methods that compare local structures in proteins have been developed to date. In comparing such local sites, a simple measurement, RMSD, has been used so far. Results This paper introduces new machine learning algorithms that refine the similarity/deviation for comparison of local structures. The similarity/deviation is applied to two types of applications, single template analysis and multiple template analysis. In the single template analysis, a single template is used as a query to search proteins for active sites, whereas a protein structure is examined as a query to discover the possible active-sites using a set of templates in the multiple template analysis. Conclusions This paper experimentally illustrates that the machine learning algorithms effectively improve the similarity/deviation measurements for both the analyses.</p

CMASA: an accurate algorithm for detecting local protein structural similarity and its application to enzyme catalytic site annotation

Author: A Andreeva
A Stark
A Stark
BW Matthews
CJ Sigrist
CT Porter
E Krissinel
ED Scheeff
G Ausiello
GJ Kleywegt
Gong-Hua Li
H Ago
HM Berman
I Boltes
IN Shindyalov
JA Barker
JA Gerlt
JC Lagarias
Jing-Fei Huang
JS Fetrow
JW Torrance
K Kinoshita
L Holm
P Chen
PF Gherardini
RA Laskowski
RD Finn
RV Spriggs
S Schmitt
SF Altschul
SF Altschul
T Fawcett
T Madej
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background The rapid development of structural genomics has resulted in many "unknown function" proteins being deposited in Protein Data Bank (PDB), thus, the functional prediction of these proteins has become a challenge for structural bioinformatics. Several sequence-based and structure-based methods have been developed to predict protein function, but these methods need to be improved further, such as, enhancing the accuracy, sensitivity, and the computational speed. Here, an accurate algorithm, the CMASA (Contact MAtrix based local Structural Alignment algorithm), has been developed to predict unknown functions of proteins based on the local protein structural similarity. This algorithm has been evaluated by building a test set including 164 enzyme families, and also been compared to other methods. Results The evaluation of CMASA shows that the CMASA is highly accurate (0.96), sensitive (0.86), and fast enough to be used in the large-scale functional annotation. Comparing to both sequence-based and global structure-based methods, not only the CMASA can find remote homologous proteins, but also can find the active site convergence. Comparing to other local structure comparison-based methods, the CMASA can obtain the better performance than both FFF (a method using geometry to predict protein function) and SPASM (a local structure alignment method); and the CMASA is more sensitive than PINTS and is more accurate than JESS (both are local structure alignment methods). The CMASA was applied to annotate the enzyme catalytic sites of the non-redundant PDB, and at least 166 putative catalytic sites have been suggested, these sites can not be observed by the Catalytic Site Atlas (CSA). Conclusions The CMASA is an accurate algorithm for detecting local protein structural similarity, and it holds several advantages in predicting enzyme active sites. The CMASA can be used in large-scale enzyme active site annotation. The CMASA can be available by the mail-based server (<url>http://159.226.149.45/other1/CMASA/CMASA.htm</url>).</p

Isolation and in silico characterization of novel esterase gene with β-lactamase fold isolated from metagenome of north western Himalayas

Author: A Knietsch
A Wiseman
AC Wallace
AK Sudan
C Schmeisser
D Perez
DE Gillespie
EI Petersen
EY Yu
J Foght
J Vakhlu
J Vidya
JA Fuhrman
JD Bendtsen
JL Arpigny
K Liebeton
K Rashamuse
K Rashamuse
M Wiederstein
MA Larkin
ML Verdonk
N Eswar
N Mokoena
P Mullany
PF Gherardini
R Berlemont
R Daniel
RA Laskoswski
S Biver
T Morrohoshi
TA Binkowski
UG Wagner
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Public Library of Science (PLOS)

Exploring the Evolution of Novel Enzyme Functions within Structurally Defined Protein Superfamilies

Author: A Andreeva
AE Todd
AL Cuff
Alison L. Cuff
AU Tamuri
BE Engelhardt
BH Dessailly
C Chothia
CA Orengo
Christine A. Orengo
DA Benson
DE Almonacid
DM Schmidt
DS Tawfik
G Caetano-Anolles
GA Reeves
Gemma L. Holliday
GJ Bartlett
GJ Binford
GL Holliday
GL Holliday
GL Holliday
HS Park
I Nobeli
Ian Sillitoe
J Ruan
J Shi
Janet M. Thornton
JP Overington
K Katoh
LH Greene
M Bashton
M Groll
M Xu
ME Glasner
MT Murakami
N Furnham
N Gallastegui
Nicholas Furnham
NJ Mulder
O Khersonsky
PF Gherardini
PJ O'Brien
Roman A. Laskowski
SC Pegg
SD Brown
SF Altschul
W Heinemeyer
WS Valdar
Yanay Ofran
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

In order to understand the evolution of enzyme reactions and to gain an overview of biological catalysis we have combined sequence and structural data to generate phylogenetic trees in an analysis of 276 structurally defined enzyme superfamilies, and used these to study how enzyme functions have evolved. We describe in detail the analysis of two superfamilies to illustrate different paradigms of enzyme evolution. Gathering together data from all the superfamilies supports and develops the observation that they have all evolved to act on a diverse set of substrates, whilst the evolution of new chemistry is much less common. Despite that, by bringing together so much data, we can provide a comprehensive overview of the most common and rare types of changes in function. Our analysis demonstrates on a larger scale than previously studied, that modifications in overall chemistry still occur, with all possible changes at the primary level of the Enzyme Commission (E.C.) classification observed to a greater or lesser extent. The phylogenetic trees map out the evolutionary route taken within a superfamily, as well as all the possible changes within a superfamily. This has been used to generate a matrix of observed exchanges from one enzyme function to another, revealing the scale and nature of enzyme evolution and that some types of exchanges between and within E.C. classes are more prevalent than others. Surprisingly a large proportion (71%) of all known enzyme functions are performed by this relatively small set of 276 superfamilies. This reinforces the hypothesis that relatively few ancient enzymatic domain superfamilies were progenitors for most of the chemistry required for life

CiteSeerX

LSHTM Research Online

UCL Discovery

FigShare

The LabelHash algorithm for substructure matching

Background: There is an increasing number of proteins with known structure but unknown function. Determining their function would have a significant impact on understanding diseases and designing new therapeutics. However, experimental protein function determination is expensive and very time-consuming. Computational methods can facilitate function determination by identifying proteins that have high structural and chemical similarity. Results: We present LabelHash, a novel algorithm for matching substructural motifs to large collections of protein structures. The algorithm consists of two phases. In the first phase the proteins are preprocessed in a fashion that allows for instant lookup of partial matches to any motif. In the second phase, partial matches for a given motif are expanded to complete matches. The general applicability of the algorithm is demonstrated with three different case studies. First, we show that we can accurately identify members of the enolase superfamily with a single motif. Next, we demonstrate how LabelHash can complement SOIPPA, an algorithm for motif identification and pairwise substructure alignment. Finally, a large collection of Catalytic Site Atlas motifs is used to benchmark the performance of the algorithm. LabelHash runs very efficiently in parallel; matching a motif against all proteins in the 95 % sequence identity filtered non-redundant Protein Data Bank typically takes no more than a few minutes. The LabelHash algorithm is available through a web server and as a suite of standalone programs a

CiteSeerX

Borrelia burgdorferi Requires the Alternative Sigma Factor RpoS for Dissemination within the Vector during Tick-to-Mammal Transmission

Author: A Battesti
AB Molofsky
CH Eggers
CH Eggers
CH Eggers
Christian H. Eggers
CJ Pappas
D Grimm
DL Cox
DS Samuels
DS Samuels
E Fikrig
E Hodzic
E Klauck
EA Rogers
F Gherardini
FC Fang
H Xu
H Xu
J Miklossy
J Ohnishi
JA Boylan
JA Hyde
JC Setubal
JD Radolf
Jenifer Coburn
JL Bono
Justin D. Radolf
K Promnares
K Tilly
M He
M Kumar
M Whiteley
Melissa J. Caimano
MJ Caimano
MJ Caimano
MJ Caimano
MJ Caimano
O Brorson
O Brorson
O Brorson
PF Policastro
PS Alban
Q Xu
R Hengge-Aronis
R Hengge-Aronis
RD Gilmore Jr
RD Gilmore Jr
RJ Pollack
RR Montgomery
S Banik
SJ Norris
SM Chiang
SM Dunham-Ems
Star M. Dunham-Ems
T Dong
TG Patton
TG Schwan
TM Gruber
U Pal
U Pal
U Pal
U Pal
V Mulay
VB Mulay
WS Rasband
XF Yang
YS Balashov
Z Ouyang
Publication venue: Public Library of Science
Publication date: 01/02/2012
Field of study

While the roles of rpoSBb and RpoS-dependent genes have been studied extensively within the mammal, the contribution of the RpoS regulon to the tick-phase of the Borrelia burgdorferi enzootic cycle has not been examined. Herein, we demonstrate that RpoS-dependent gene expression is prerequisite for the transmission of spirochetes by feeding nymphs. RpoS-deficient organisms are confined to the midgut lumen where they transform into an unusual morphotype (round bodies) during the later stages of the blood meal. We show that round body formation is rapidly reversible, and in vitro appears to be attributable, in part, to reduced levels of Coenzyme A disulfide reductase, which among other functions, provides NAD+ for glycolysis. Our data suggest that spirochetes default to an RpoS-independent program for round body formation upon sensing that the energetics for transmission are unfavorable

One origin for metallo-β-lactamase activity, or two? An investigation assessing a diverse set of reconstructed ancestral sequences based on a sample of phylogenetic trees

Author: A Coulson
A Shimada
A Yamamura
AC Palmer
AE Todd
AM Burroughs
B Autzen
BG Hall
BG Hall
BG Hall
BG Hall
BG Hall
C Bebrone
C Bruns
C Chothia
C Lakner
CJA Sigrist
CT Porter
D Weinreich
D Xu
DA Alfredson
Daniel Barker
E Paradis
E Quevillon
EC Meng
EM Zdobnov
F Lutzoni
FC Bernstein
G Garau
GL Holliday
H Ashkenazy
I Sillitoe
IN Shindyalov
J Bergsten
J Felsenstein
J Felsenstein
J Felsenstein
J Huelsenbeck
J Lees
J Lees
J Spencer
JB Plotkin
JH Ullah
JK Hobbs
John B. O. Mitchell
JW Torrance
K Katoh
K Katoh
K Katoh
L Aravind
LA Kelley
M Galleni
M Hendy
M Pagel
M Pagel
ME Alfaro
MN Wass
N Furnham
N Furnham
N Latysheva
NC Butzin
P Lemey
P Menzel
P Oelschlaeger
PC Babbitt
PD Williams
PF Gherardini
RA Laskowski
RA Laskowski
RD Finn
RG Alderson
RJ Edwards
RM Bush
Rosanna G. Alderson
S Guindon
S Sikora
S Whelan
T Ashfield
TA Hall
TA Holton
The Uniprot Consortium
TM Keane
V Anantharaman
V Hanson-Smith
VA Risso
VA Risso
VM D’Costa
WM Fitch
Y Huang
Z Wang
Z Wang
Z Yang
Z Yang
Z Yang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

This work was supported by BBSRC (grant BB/F016778/1)Bacteria use metallo-β-lactamase enzymes to hydrolyse lactam rings found in many antibiotics, rendering them ineffective. Metallo-β-lactamase activity is thought to be polyphyletic, having arisen on more than one occasion within a single functionally diverse homologous superfamily. Since discovery of multiple origins of enzymatic activity conferring antibiotic resistance has broad implications for the continued clinical use of antibiotics, we test the hypothesis of polyphyly further; if lactamase function has arisen twice independently, the most recent common ancestor (MRCA) is not expected to possess lactam-hydrolysing activity. Two major problems present themselves. Firstly, even with a perfectly known phylogeny, ancestral sequence reconstruction is error prone. Secondly, the phylogeny is not known, and in fact reconstructing a single, unambiguous phylogeny for the superfamily has proven impossible. To obtain a more statistical view of the strength of evidence for or against MRCA lactamase function, we reconstructed a sample of 98 MRCAs of the metallo-β-lactamases, each based on a different tree in a bootstrap sample of reconstructed phylogenies. InterPro sequence signatures and homology modelling were then used to assess our sample of MRCAs for lactamase functionality. Only 5 % of these models conform to our criteria for metallo-β-lactamase functionality, suggesting that the ancestor was unlikely to have been a metallo-β-lactamase. On the other hand, given that ancestral proteins may have had metallo-β-lactamase functionality with variation in sequence and structural properties compared with extant enzymes, our criteria are conservative, estimating a lower bound of evidence for metallo-β-lactamase functionality but not an upper bound.Publisher PDFPeer reviewe